CDS

Accession Number TCMCG036C11497
gbkey CDS
Protein Id PTQ39441.1
Location "complement(join(853327..853395,853519..853686,854338..854557,854689..854751,855024..855154,855266..855442,855542..855716,855845..855916,856082..856190,856296..856371,856521..856571,856667..856748,856866..857081,857440..857559,857652..857803,857958..858144,858313..860803,860987..861242,861344..861498,861660..861930))"
GeneID Phytozome:Mapoly0045s0096
Organism Marchantia polymorpha
locus_tag MARPO_0045s0096

Protein

Length 1746aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA53523, BioSample:SAMN00769973
db_source KZ772717.1
Definition hypothetical protein MARPO_0045s0096 [Marchantia polymorpha]
Locus_tag MARPO_0045s0096

EGGNOG-MAPPER Annotation

COG_category -
Description -
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE -
KEGG_ko -
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTCTTCCCGCCTCGAAAGTAAGGAAGTGCAAATTTGGGAATCTCCCCCCAAGAGCCAGACGAGGATGAAGCTCAGCCAGGATTCAAATCCAAATTCCCGGATGTGGAGCTATGGAGTTGTTGAAACGAGATATTCGTTCATTTCTACCGTCACACCTATCGAAACTACCGGAGAGAGATTTGCACCTAGCCTTCAGATTCTGAAGGACCAGCTTGATGCCGAGACGAAGAAGTATCCACCTTCACAAGATGCTGCACCGGCTTGTGCAGGTTCGAAACGTAGGCTTGAACCTACACACGAGCAAGATTATGTATCCTCAGACGGGTCCGGCAATTGCGCCAAGATGAAATCGTCTATGTCGAAGCCGCCGGCATTCAAAGCATCCACTATTCGAACGAGTACACAGCGCAGAGTGCGATTCTTGCCCGACGCGAGGGCCGATGAGCAGGCGGTCGAGGGTTCAGCGTCCCAGGAGGAAGTAGAAGAGCACACGTTTCCGGTTTTGAAAAAAGGCAGGTTGAGTTTTGGCGGCAGTGACGTCATCCAGGGCCTCTTGGCAAGACGAGAATCAGCACCTGGCCCCGCGTCCATTTGTGGCGAACGGGAGCTCAGTCAGATTGCTCGGCCACGAGCACTGATCGTCACCCCAAAAGTGCCCGAGGAAGTGGCCCCTGGCAAAGGAGTGAGGACGACGGAGCTGAAATTCGACCCTCAAGTACGCATGTCTGACGAAACGAGGCAGAATGGTTCACTCATGACACCGAACTCTGGGACCACATCCTCTGTTAACGATTGCGAGGCACAGACGAAGGAAAGCAGGTACGGTACATTGCCGGTGACTTTGAAGCGGCCTGAGCTCGTATCAAGCGTCGAGACGCTAAACAAGAAAGCTAAAAGAAATGCTGACCAGTCAATTGCGGCCTTGTGGCCAACAGCAGGCCCTGGTTGTGCAGGACCCGTTCGCTTCCCAGACGAGTCGGAGGGGACGCTTCCGTCTCCAGTTGTACCATATTGTGAGTCTCGCACGAGACGTCAAGTGCCTGCAGTTTTCGAGGATAGTTTGGTGCAAGATTCGCAGGACGACTCTGAGTCCGAAGGCTTTCCCGTAGCTCAAAATTCTTCTATCAAAGGTTGCCCCAAGCTTGGCGACATTAGTCCGCAGTCGTTCGTCAGAAGCTCTCTTCCAGTTCGACCGCCCTGCGATCAAAATGTGGAGAACAGGGTCGAATCTGTTGGCAATGCGGAGAGAACTCACTTCGACGAAAGACCACAGAGGGAAGCAGTGGACAGAGGATGCGCCACTGACGGAGTGTTTCGACGACCGGAAGGTGCGAAATTTAGCCGCTCTTCGTGTACAGGAAGTCGGGCGGAGTCCTTGGACGGACTTGGGGCACAAGTTACTGTCAATGATTTTTATCCCGGACAGGAGAAACGAGGTGAATGTAGGGGCCCGATGCCCCCGAATGGCGAGGGCGCTGCTTGTGATTCATCATCTATCCGAGGATATGACGCTATGCCCATAAATAATGAGGAGTTGTGGCGTCTTGGGCCTCCGGAATTCGTTTCGTCCAGACCAGCCCATGGATGTGGGAAGGTCCAAGAGCTTGAGCTATTTTCGAATGCTTTCCGGACGGGCAAGGGAAATCCTGTTCAAATTTCTGCATCAGCGATTAGCAGAGTCGCATCATTATTTGAAGATGTGTCTCCAGGTCGTGAATCCCGGGCGAATGTCTTCACGCCTCCAAAATGTGCCGCAAGAAATAAAACTGTTTCCTCTTCTCCAGCTCTCGCTGATGCCCATTCTCTTCAATCTGTGAGGAAAAATCTGTATCACAGCTCCCTGTCAGTGCAAGATGTCGATTGTCCAGAAATCTCCCTCGTACATCAAGCTCGAAAAGCTGGCACGCAGGGCAAAACGGCATCACTTTTTCAAACTGCAAGGGGATTACCTGTCCATGTCTCTGCTGAGAGTATACAGAGAGTGCAACCAATTTTTTCAACACCTGAAACTTTACCATCGAAGGAAGATGTGATGATCCTTAAACCCTCTGTAGAAGACGAGGGATTAGCTCACAATTCACGTTCTGGAGCTCTTTCTTCTCGTGCAGTCAAAGGAAGAGCAAATCCTTCCATCATAGGAGGGGTGAACGACGGGGCCATAGGGCGACGAGATTCCTCTGGGACTGGAAGTGCTGGCGCGTCGAGGTCAAATTTATTCCGGACAGCGAGAGATACACCTGTAACAATCTCATCTGCAGCGCTGCAAAAGATTTTAAACATTTTTGAAGACGAAGATCTTACCCCACCACCGAGCAGGGAAGGGGTCGGTGCCATACCAAGAGATCTTGAGGAGAAGAGTGCTACCTCTATTCCTTCTACCTTGCCTGCAGCAGGATTCCTCGTACCCGTGAAGAATCAGGACACAAGTCGGGAGCTTCTCTCAGGATGTCGTCATGATGCTAATTCACTCCCTCGAGTTATCCCGCCCGAAGTAACAACACTTCACGGAACAACTCCAATGCGCGGGTTTAGGACGGGGTTTTCCACAGATCATCCTGAATTTACGGAAAATGCCGAGCCGAAGTCTAAACTGTCTCCCGGGGAGGTTGTGCTTCGTGAAAGAGCGAATTACATTCGATATCAGTGGGATAGAATCTTTGAGCCCCGGCTGGGGTTTAGCCTTGAATCTCGTGCCAACCGCGACATCTCTGGGTCATTTGCTGTTGAAGCCGGGCTTGGAAAATCGGCAGCTCTCAATCATGCAACGATTGAGGAGAGTCCCTACTTAGCGGGTTCCCGTCACGCTGAGGTTCGAAAGGATGCATCGCTGGAGGGAGGCAACGGGAGTTTCAGTTTCAAGGGTAGTTCCGGACGCACATTGACAATCTCGAGCGCAGCCAAAGCGAAAGCGGAAGCTCTTTTGAAGCTTGGTCCTGAATTTGTGACCCCGCCGAAGTTTGAAGCACGCAAACCTTCATTTGAGTTCAGAACTAAGGCCTCAAAATCAGATGCGCGGTCGTTGCTAGAAGTTGAAGAACGCAAACAGTATACGAGTGCTGTCAACCATGGTCGCAAGCAGGCCATCAACGATGTAGACAAAAGAGCACAGGAGAAGAGCGAGTCTGGAAGCGGGCGTGGGTCAAGGGCTTTCAAGGCGCCAAGAATACTCAGGCCCTCACTAAGCTTGCCTGATCGAAGAAAGTACATTGGTTTCCAACCCGGCATTGTAACTGCTGCACGGCGGAACTTTATTTTACCTGATGGTTTACATTACCTTCACGAAAAGAAATCACGAATAAAACGAGGTCGATTGCAGTTGAGTGAATACTTTGGTGGACCTCCTCATCCGGGTGCCAAGACTCTCTCCAATTTGTCGAAGGAAGTTTTGTCAATAACTGCAGACACAGCTGAGACTTATAGAATTCCTGACGGAGGAACGGGTGTTAGTGTGGACGAAGTTTGGATTATGCTCAAAGACTTGGGAGCTGATCAACGGTACGCCAGCAAAGGGTGGGTGGCTAACCACTTCAAATGGATTGTGTGGAAATTGGCGTCATACGACCGTAGATTTCCTCGTCCGAAGGCCTGTCTGCTGACATTATCGTCTGTGTTGAATGAGCTCCGGTATAGATACGATAGGGAGTTGAACGGGGAACGATCAGCTCTGAAACGAGTGACAGAGAGAGATTCTTCTGCAGGTCACACTATGGTCTTGTGTGTGTCAGCAGTTCGGAGTGTTAAGGACATTGAACATTTTTATGAAATTTGGATCGAGAATAGAGATAACGCTGAAGGAAATAAGGCCAAGCTTCCTCCAGCGGGCGTCATCGAAGTAACTGATGGATGGTATAGACTGAATGCGCATGTTGATGTACCTCTTGCAAGGCGCATTCACAGTGGAAAACTCAAGGCTGGACAGAAAATCGTGGTATGTGGAGCAACTTTGGAAGGTCTTTCTGATGGATTACCACCTCTTCAGGCGTTTGATAATGCTTATCTGTGTATCAACTCAAATGGCGTTCGTAGGGCCCGATGGGACCAGCGTTTGGGTTTCTGTGGGACAGCAACACCGCTTGCGTTCAAGTGCATTACACACGATGGTGGAGTGATTCCCCACACTTATGTTGCCATAACACGGAGATACCCCTTACTTCATAGAGAGAAACTCGCTGGTGGTGGTTATGTTTTGAGATCTGAGCGTGCAGAAGACCTGGCTGCACATGATTTTGATAAAAGGCGTGCCAAAGTGGCGGAAGAGGCAATTCAGAAGCTAGAGGATAGGGACAGCTCCTCAGATGAAGGAGCAAAGTTATTTTTTGCACTGGAAACAACAACCGATCCTGAAAGCCTCCTGCTCGGCATGACACGCTCTCAATTAGATGCTTTTGCAGCCTTCAAAGCTCGAAGAGAGGAGTCCAGGCAAGTTTTGACAGGAGACCTGATCAGGAATGCACTTGAGCTTCATGGTCTGGCATCGCGAGAAGTAACACCAACCTTCAAAGTTCGTGTAGTAGGATTAAATCGCAAAGGAGACCCAGGATCAGACAAGGACGGTCTGGTTACAATCTGGCATGCTACTGAAGAGCTGCAAGAGCAGCTACAAGAAGGGAAGGTGTTTTGCGTCTACGGGCTGCGTCCATCCACGAAAGGATCGTATCATTCATTTAACTCAACAAGATGGCAACCGGTGTCGTTTTCAACTCTGGAAAAACATTTTGAGTTTCCGTACGTTCCGCGTTTTCCTCTGCCAATATCCAATCTCAGCGAATATATGCCGTGCAGAGAATTCGATGTTGTTGGGTTGGTACTTTTGGTAACGGATCCCGTCACTAGGGACACATTTCGTGGAAGAGCAAAGTCTCAATGGGCATTCATGACGGATGGCTCTTGTGCCAAAGGTGATGCCGCGGACGCGTGGGAGGAAACTCAGGTCCTTGCGATTGAAATCAGCTGGCCCGAGCAAGCATTTGTTTCGTTTGAAAAAACCCTCGTAGGTTCCGTGGTTGGTTACTGTAACCTGTGGTTGAAGCACAGAGATCAAAGCAACCGCCTGTGGGTAGCTGAAGCAACTGAGAGATCTCACTACTCAAGCAAAATGACTTCACCTGCATTTCGCCACCTCACTGGTGCAGCTCTCGAAGTACAAAAGTGGTCAAAACTCAATTCTTGGAATGTAGATATTCTCCAATCTGATGTGGAGCAACTGGCAGTACACTTTGAAAACCCGTCAAGCGCTTAG
Protein:  
MSSRLESKEVQIWESPPKSQTRMKLSQDSNPNSRMWSYGVVETRYSFISTVTPIETTGERFAPSLQILKDQLDAETKKYPPSQDAAPACAGSKRRLEPTHEQDYVSSDGSGNCAKMKSSMSKPPAFKASTIRTSTQRRVRFLPDARADEQAVEGSASQEEVEEHTFPVLKKGRLSFGGSDVIQGLLARRESAPGPASICGERELSQIARPRALIVTPKVPEEVAPGKGVRTTELKFDPQVRMSDETRQNGSLMTPNSGTTSSVNDCEAQTKESRYGTLPVTLKRPELVSSVETLNKKAKRNADQSIAALWPTAGPGCAGPVRFPDESEGTLPSPVVPYCESRTRRQVPAVFEDSLVQDSQDDSESEGFPVAQNSSIKGCPKLGDISPQSFVRSSLPVRPPCDQNVENRVESVGNAERTHFDERPQREAVDRGCATDGVFRRPEGAKFSRSSCTGSRAESLDGLGAQVTVNDFYPGQEKRGECRGPMPPNGEGAACDSSSIRGYDAMPINNEELWRLGPPEFVSSRPAHGCGKVQELELFSNAFRTGKGNPVQISASAISRVASLFEDVSPGRESRANVFTPPKCAARNKTVSSSPALADAHSLQSVRKNLYHSSLSVQDVDCPEISLVHQARKAGTQGKTASLFQTARGLPVHVSAESIQRVQPIFSTPETLPSKEDVMILKPSVEDEGLAHNSRSGALSSRAVKGRANPSIIGGVNDGAIGRRDSSGTGSAGASRSNLFRTARDTPVTISSAALQKILNIFEDEDLTPPPSREGVGAIPRDLEEKSATSIPSTLPAAGFLVPVKNQDTSRELLSGCRHDANSLPRVIPPEVTTLHGTTPMRGFRTGFSTDHPEFTENAEPKSKLSPGEVVLRERANYIRYQWDRIFEPRLGFSLESRANRDISGSFAVEAGLGKSAALNHATIEESPYLAGSRHAEVRKDASLEGGNGSFSFKGSSGRTLTISSAAKAKAEALLKLGPEFVTPPKFEARKPSFEFRTKASKSDARSLLEVEERKQYTSAVNHGRKQAINDVDKRAQEKSESGSGRGSRAFKAPRILRPSLSLPDRRKYIGFQPGIVTAARRNFILPDGLHYLHEKKSRIKRGRLQLSEYFGGPPHPGAKTLSNLSKEVLSITADTAETYRIPDGGTGVSVDEVWIMLKDLGADQRYASKGWVANHFKWIVWKLASYDRRFPRPKACLLTLSSVLNELRYRYDRELNGERSALKRVTERDSSAGHTMVLCVSAVRSVKDIEHFYEIWIENRDNAEGNKAKLPPAGVIEVTDGWYRLNAHVDVPLARRIHSGKLKAGQKIVVCGATLEGLSDGLPPLQAFDNAYLCINSNGVRRARWDQRLGFCGTATPLAFKCITHDGGVIPHTYVAITRRYPLLHREKLAGGGYVLRSERAEDLAAHDFDKRRAKVAEEAIQKLEDRDSSSDEGAKLFFALETTTDPESLLLGMTRSQLDAFAAFKARREESRQVLTGDLIRNALELHGLASREVTPTFKVRVVGLNRKGDPGSDKDGLVTIWHATEELQEQLQEGKVFCVYGLRPSTKGSYHSFNSTRWQPVSFSTLEKHFEFPYVPRFPLPISNLSEYMPCREFDVVGLVLLVTDPVTRDTFRGRAKSQWAFMTDGSCAKGDAADAWEETQVLAIEISWPEQAFVSFEKTLVGSVVGYCNLWLKHRDQSNRLWVAEATERSHYSSKMTSPAFRHLTGAALEVQKWSKLNSWNVDILQSDVEQLAVHFENPSSA